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Parton Distributions: Progress and Challenges 
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We briefly discuss recent research on the spin-averaged parton densities of the proton, 
focusing on some aspects relevant to hard processes at the LHC. Specifically, after 
recalling the basic framework and the need for higher-order calculations, we address 
the evolution equations governing the scale dependence of the parton distributions and 
their solution, schemes for initial conditions and the inclusion of heavy quarks, recent 
progress on fits to data, and future high-precision constraints from LHC measurements. 

1 Introduction: partons for the LHC 

For at least the next ten years, proton -(anti-) proton colliders will continue to form the 
high-energy frontier in particle physics. At such machines, many quantitative studies of hard 
(high mass/scale) standard- model and new-physics processes require a precise understanding 
of the parton structure of the proton. The present talk [1] briefly discusses some recent 
developments in this field. 

We start by recalling the description of hard 
proton processes using the simplest case, in- ^ 
elusive lepton-proton dcep-inclastic scattering 
(DIS), the process providing the major part of 
the present constraints on the parton densities. 



Here the hard scale is the virtuality Q 



of 



the exchanged gauge boson, a photon in Fig. [U 
and the Bjorken variable x = / {2Pq) , with P 
the proton momentum, is usually chosen as the 
second independent variable. At zcroth order in 
the strong coupling constant ag the hard coeffi- 
cient functions Ca,i are trivial, and the momen- 
tum fraction ^ carried by the struck quark i is 
equal to Bjorken- a; if mass effects are neglected. 
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Figure 1 : Kinematics of photon-exchange 
DIS in the QCD-improved parton model. 
Particle momenta are given in brackets. 



In general, the structure functions Ffr for the process of Fig. [T] are given by 



E 



(1) 



plus terms of order l/Q^ which, for the purpose of high-scale predictions, are best suppressed 
by sufficiently stringent cuts on the fitted experimental data. Besides on the factorization 
scheme used to define the parton densities /f - in this talk MS unless stated otherwise - 
the coefficient functions depend on the renormalization and factorization scale /i (identified 
here for notational simplicity) which ought to be chosen as = 0{Q^) in order to avoid 
large logarithms. The parton distributions depend on this scale via the evolution equations 
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Here (g) is a short-hand for the MeUin convolution written out in Eq. ([I]) above. The initial 
conditions for Eq. ^ are, of course, not calculable in perturbative QCD. As lattice results 
are restricted to very few Mellin moments (with, at present, still rather limited accuracy), 
predictions for collider cross sections are obtained via fits to suitable sets of reference ob- 
servables, including structure functions in DIS, and the universality of the parton densities. 

The splitting functions P and the process-dependent hard coefficient functions Ca admit 
expansions in powers of ckg, 



P 



a. 



(3) 



with, for example, tIq = for F2 and Ua = 1 for F^. For a consistent approximation the 
same number of terms has to be kept in the two lines of Eq. Q. The first n+1 terms define 
the N"LO approximation. As the normalization of the LO prediction is rather arbitrary, 
the next-to-lcading order (NLO) provides the first real prediction of the cross sections and, 
consequently, the NNLO the first serious error estimate of the perturbative expansions. 

The successive approximations of perturbative QCD arc illustrated in Fig.[2]for an LHC 
process of utmost importance, the production of the standard-model Higgs boson domi- 
nated by gluon-gluon fusion via a top-quark loop. Obviously the NLO approximation [2] is 
insufficient for a quantitative prediction in this case, and even at NNLO [3,4] higher-order 
uncertainties of about 15% remain for the total cross section. A perturbative accuracy of 5% 
is only reached at N'^LO, known to a sufficient approximation from Ref. [5] (for an extension 
to the rapidity distribution sec Ref. [6]). Note that these uncertainties do not include those 
of the coupling as and the parton densities, taken for Fig. [2] from Ref. [7] where at NNLO 
previous (but sufficiently accurate) approximations [8] were used for P'^' in Eq. jS]). 
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Figure 2: Perturbative expansion of the total cross section for Higgs production at the LHC. 
Shown are the dependence on the mass Mh and the renormalization scale ^.^ (from Ref. [5]). 
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The minimal momentum fractions 
^_ of partons contributing to the pro- 
duction of a particle of mass M at 
the LHC are shown in Fig. [Sj together 
with the kinematic reach of HERA and 
fixed-target DIS experiments at the 
corresponding scales Q^. Taking into 
account also the limited rapidity cov- 
erage of the LHC detectors, one reads 
off > 10^"* for the most important 
processes, including the production of 
the W, Z and Higgs bosons and the top 
quark, and the search for new particles. 
Thus the HERA data can be fitted with 
a cut of Q2 10 GeV^ which should 
be sufficient to suppress low-scale in- 
stabilities (as, e.g., in Fl to NNLO [9]) 
and power corrections to Eq. ([1]). 
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Figure 3: Minimal parton momenta ^_ probed 
at the LHC, compared with the DIS coverage of 
HERA and previous fixed-target experiments. 



2 Higher orders in the parton evolution 

The complete NNLO splitting functions P^'^\x) - from now on we, as usual, denote also 
the parton momentum fractions by x - in Eq. ([3]) have been computed three years ago 
in Refs. [10,11]. We first consider the fiavour non-singlet evolution of quark-distribution 
differences such as the combination g+ — u + u ~ {d + d) probed by F2 — F^. Figure |4] 
illustrates the perturbative expansion of the Mellin moments of the corresponding splitting 
function and the resulting approximations for the scale dependence of q^^ at large x. 



d In q / d In [J,^ 




Figure 4: The LO, NLO and NNLO approximations to the splitting-function moments 
P^{N) for four flavours at ctg = 0.2, and the resulting logarithmic scale derivatives for 



0,0.5 



(1 — x) , a schematic but characteristic model distribution (from Rcf. [10]). 
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The first fourth-order resuh for this sphtting function lias been presented last year [12]: 
the second moment of P^^ix) is now known to N'^LO for three flavours, numerically reading 

P+{N^2,nj^3) = -0.283 as [1 + 0.869 as + 0.798 a^ + 0.926 a^ + ...]. (4) 

Taking into account the weak A^-dcpendcncc of at N >2 demonstrated in Fig. [H this 
result sets the scale for the N'^LO contributions for the whole large-x region. According to 
the general pattern, the corresponding corrections for rij = 4 ... 6 will be even smaller. 

The low-x behaviour of the non-singlet splitting functions and coefficient functions is not 
too relevant in practice, but provides an interesting lab for the study of small- a; logarithms: 
unlike in the singlet case, two additional powers of In a; enter per order in ag, e.g., terms up 
to In'^ X and In^ x occur in P^s and C2 „s already at order ag . Successive approximations of 
these functions including the leading, next-to-leading, . . . small-x terms are shown in Fig. [H 
Obviously, a 'low-order' approximation of this type is not appropriate at any cc-values rele- 
vant to colliders. Consequently leading- and next-to-leading-log rcsummations can, at best, 
provide only very rough indications of the maximal size of the higher-order corrections. In 
the present case the all-order leading-logarithmic contributions [13] are small enough to 



exclude small-x instabilities, e.g., for xq^ 



,0.5 



down to extremely low values of x [14]. 
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Figure 5: The exact a^ contributions to the non-singlet splitting function and coefficient 
function for F2, compared to approximations obtained from the small-x logarithms (from 
Refs. [10, 15]). The leading small- a; term of P^s+ ^''^^ derived before in Ref. [13]. 

The flavour-singlet splitting functions are vital for transferring small- a; information from 
HERA to LHC scales across up to three orders of mag nitude in rccaU Fig. [3l The 
corresponding NNLO contribution [11] to Eq. ([2]) is shown in Fig.[6]for the gluon-gluon case. 
Also here the leading small- a; term, obtained before in Ref. [16] (and transformed to MS in 
Ref. [17]), does not provide a good approximation for any practically relevant values of a;. 
Moreover, the splitting functions enter physical quantities only via the Mellin convolutions 
of Eqs. ^ and ([2]). Hence a locally accurate low- a; approximation, as provided for Pgg^ by 
the a:~^lna; plus the x~^ terms at a; < 10"'^, is insufficient for dg/dlnji^ even at small x. 
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Figure 6: Left: the contribution Pgg to the gluon-gluon splitting function, compared to 
its leading [16, 17] and next-to-leading small- a; approximations. Right: the convolution of 
these three functions with a schematic but typical gluon distribution (from Rcf. [11]). 

Consequently, reliable estimates of the post-NNLO corrections to the small- a; evolution 
will become possible, via approximations analogous to those of Ref. [17], only once a few 
singlet moments have been computed to order a*. Fortunately, as illustrated in Fig. [71 the 
expansion of the quark and gluon evolution to NNLO appears to be very stable, at least for 
the main HERA-to-LHC region x > 10^^ at > iq Q^y^ (ggg above). 
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Figure 7: Perturbative expansion of the scale derivatives of typical quark and gluon distri- 
butions at ji^ « 30 GcV^ (from Ref. [11], where the initial conditions arc specified). 
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3 Solutions of the evolution equations 

The most direct manner to solve the system ^ of coupled integro-differential equations is 
by a discretization in both x and ji^. Recently written or updated public codes including the 
NNLO evolution are HOPPET [18] and the new version 17 (beta-released at the time of this 
talk) of QCDNUM [19]. Alternatively, Eqs. ^ can be transformed to ordinary differential 
equations in (complex) Mellin-A^ space. These are then treated analytically and the solutions 
transformed back by quadratures. This approach has been employed in QCD-Pegasus [20]. 
The left part of Fig. [5] shows a sample comparison of the programs [18] and [20], using the 
Les Houches initial conditions discussed below. The right part of the figure, taken from the 
QCDNUM manual, illustrates the greatly improved numerical accuracy of the new version. 
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Figure 8: Left: ratios of high-scale NNLO up- valence and gluon distributions after evolution 
with the codes [18] (S) and [20] (V). Right: the accuracy improvement of QCDNUM due to 
the new quadratic x-interpolation with actually fewer points (for details see Ref. [19]). 

Obviously it is very useful, e.g., for validating newly written or ported codes, to have at 
one's disposal a set of benchmark evolution results. A reference input was set up for this at 
the 2001 Les Houches collider-physics workshop (see Ref. [21] for the complete expressions), 

xu^{x,^J:lo) = 5.1072 a;°-^ ~ xf , ■ ■ ■ , xg{x, fil„) = 1.7000 x""'^ (1 - x)^ (5) 

for the initial factorization scale q = 2 GcV^ and the coupling Q!s(m? = 2 GcV^) = 0.35 . 
As illustrated in Fig. [H the results of the programs [18] and [20] agree to five significant 
digits over a wide range in x and , a level of agreement not reached before between x-space 
and iV-space programs. The results at the important scale ~ 10'* GeV^ have therefore 
been cast into reference tables for the evolution at LO, NLO - including, for different initial 
conditions, the spin-dependent case - and NNLO for the scales ratios ^r/^f = 0.5, 1 and 
2, using both a fixed and a variable number of flavours (see below). For example, the 
(iterated, see Ref. [20]) four-flavour NNLO evolution of Eq. ([5]) for jif = 2/ir yields 

X = 10"^ , xu„ = 2.9032 • 10"^ , . . . , xg ^ 2.2307 • 10^ 

X = 0.9 , xu„ = 3.6527- 10""* , = 1.2489 • 10"^ (6) 

at this scale. The complete tables can be found in Refs. [21]. It would be very reasonable 
to employ only programs which have been checked against these benchmarks. 
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4 Input shapes, factorization schemes and heavy quarks 

The MS scheme adopted so far is calculationaUy convenient and leads to a perturbatively 
stable parton evolution - recall Eq. ([4]), Figs. |4] and [T] However, the NLO, NNLO, ... 
parton distributions are not physical in this scheme. Therefore MS may not be the scheme 
in which the initial distributions retain their physically motivated shapes (as long known 
for the photon structure [22]), e.g., for the proton's gluon density at large x, see Ref. [23]. 
Moreover, it seems unclear which positivity boimd in particular g(x,yu^) has to obey in 
this scheme, and which NLO partons (if any) are best suited for obtaining estimates from 
leading-order Monte-Carlo programs [24]. 

The traditional alternative to MS has been the DIS scheme [25], in which the quark 
distributions are rendered physical via the structure function i^2- For the singlet sector the 
transformation to this scheme is given by 

Qs'"^ = Qs + as[ci']j qs + 4]l ® g] + . . . 

gDis = .9 - as[4!j®gs + 4!1®5] + •■• • (7) 

Its large drawback is that the second row of Eq. ([7]) is arbitrary except for the moment N ~ 2 
fixed by the momentum sum rule. Thus there is nothing physical about the DIS-schemc 
gluon density especially where constraints are needed most, for very large and for small x. 

This shortcoming is absent in an interesting old suggestion, the DIS^ scheme going 
back (at least) to Ref. [26]. Here also the shape of the gluon distribution is rendered 
physical via the structure function F^f, of a scalar directly coupling to gluons (such as the 
Higgs boson in the large-mtop effective theory). The transformation to DIS^ at N"LO 
requires the corresponding coefficient functions c^"^ and c^"^ . Scalar-exchange DIS had to be 
considered anyway in Ref. [11] , and the determination of these coefficient function to order 
requires only a minor extension of the published calculations. These functions and possible 
constraints arising, for example, from the positivity of i^^ will be presented elsewhere. 

Now we turn to heavy quarks. For processes at a sufficiently high scale, charm and 
bottom become effectively light flavours which have to be included in the parton structure 
of the proton. For most values of x one can disregard a possible non-perturbative 'intrinsic 
charm' (or bottom) component (which, however, can be relevant at large x for some specific 
LHC processes [27]). The MS evolution of as [28] and the parton densities with a variable 
number of flavours then proceed via a matching of effective theories for different n^. The 
matching conditions for the parton distributions are especially simple at the heavy-quark 
mass, = m^. Denoting the light-quark distributions by li, they up to N^^^LO read [29] 

9 (^''^^^ = 9 + S^, al [Ajf,) gf + J^^ ® g t^'')] 
{h + h) (^'+1) = 5ra2 al [^^f ) ® gf -f 4f ) ® g (~^)] . (8) 

The results [30] underlying the qq, hq and hg coefficients have been confirmed recently [31]. 

The matching conditions ([5]) are included in the above evolution codes and benchmarks. 
Note that the NNLO discontinuities were so far ignored in the (published) MRST parton 
densities. However, they have now been implemented and found to significantly affect the 
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cross sections for WjZ production at the LHC [32]. Forthcoming updates of also the NLO 
distributions will include further significant improvements, e.g., the use of fastNLO [33] for 
jet cross sections instead of pre-calculated /\-factor tables. 

In general, the calculation of heavy-quark effects on observables is far more involved. 
We briefly summarize this issue for charm production at HERA, a process which affects the 
vital extraction of the small- a; quark and gluon densities from F^. There are three regimes: 
For Q ^ mj, only u, d, s and g act as partons, and charm production can be calculated 
using the fixed-order massive coefficient functions, presently known to NLO [34]. This 
framework is usually referred to as the fixed- flavour number scheme (FFNS). At Q ;:gj> mj, 
all terms with rndQ are negligible, and = 4 partons - obtained via the matching con- 
ditions ([8]) - can be used with massless four- flavour coefficient functions, a procedure often 
called the zero- mass variable flavour-number scheme (ZM-VFNS). Finally there is, in gen- 
eral, an intermediate region Q ^ mc, where terms with rric/Q are not negligible, but large 
quasi-coUinear logarithms require a resummation via Eqs. ([2]). Then the Uj = A partons 
have to be used with 'interpolating' coefficient functions for which several prescriptions have 
been suggested, see refs. [29,35-37]. This is the genuine (or general-mass, GM-) VFNS. 

The transition regions between these regimes are process-dependent and tend to lie at 
higher scales than one might at first expect, something to be kept in mind when using 
bottom distributions with massless coefficient functions at the LHC. For example, there are 
strong experimental (see Fig. [9]) and theoretical (recall, e.g., Ref. [38]) indications that the 
FFNS is applicable for the small-x HERA data on at least up to > ^qO GeV^. 
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Figure 9: HERA measurements of the charm structure function F2 , compared to NLO 
CTEQ calculations in the fixed [39] and variable flavour-number [37] schemes. All results 
have been normalized to the former calculation (adapted from Rcf. [40]). 
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5 Recent parton analyses and future LHC constraints 

Recently the CTEQ collaboration has published a major update, CTEQ6.5, of their NLO 
global fits [37] . A salient improvement is that the mass suppression of the charm contribution 
to F2 at HERA has finally been included - before the inadequate ZM-VFNS (see above) had 
been used. The reduced charm component is compensated by larger u and d distributions 
at small x as illustrated in Fig. [TUl This increase leads to larger predictions for the W- and 
Z-production cross sections at the LHC, by about 8%, a shift well outside the uncertainty 
bands obtained from the previous CTEQ 6.f sets [41]. It should be noted, however, that both 
this shift and the similar NNLO result of Ref. [32] mentioned above do not invalidate the 
widths of the previous error bands. Rather the previous central values should be considered 
unreliable, as they resulted from fits disregarding well-known theoretical information. 




Figure 10: Central u and d distributions of the CTEQ 6.5 fit, normalized to previous results 
from the same group [41]. Also shown are the estimated error bands resulting from the 
experimental uncertainties of the data included in the analysis (from Ref. [37]). 

In any case, it is important to have at one's disposal several independent sets of parton 
distributions at each order of perturbative QCD. Until recently, the only NNLO analysis 
besides those of MRST (now MSTW) was that of Ref. [42] , based only on data from deep- 
inelastic scattering. Last year this analysis has been expanded in Ref. [43]: a consistent 
subset has been included of the available data on Drell-Yan lepton-pair production - note 
the difference in approach to CTEQ, who are working on their treatment of inconsistent 
data sets. The NNLO corrections to these cross sections [44, 45] are found to be crucial for 
the fits and, interestingly, as before a rather low value of as{Mz) is preferred, in marked 
contrast to the recent NNLO fits of MSTW [32]. 

Usually the initial conditions for Eq. ^ are written in terms of an ansatz, as in Eq. ([S]) 
but with more free parameters. The resulting bias is monitored by varying this functional 
forms as, e.g., in the two dashed curves in Fig. 1101 An alternative approach is pursued by 
the NNPDF collaboration, using neural networks to avoid any such bias. A first analysis of 
non-singlet structure functions has been performed in Ref. [46] , using a new hybrid evolution 
method combining advantages of the x-space and Mellin-A'^ techniques mentioned above. 
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Four uncertainty bands for the com- 
bination u + u— (d + d) of NLO quark 
distributions are displayed in Fig. 1111 
There are many differences between the 
chosen analyses of Refs. [7,41,42,46], 
thus it seems difficult to isolate the 
possible impact of the paramctrization 
bias. It would be interesting to see fits 
of a reference data set using different 
approaches to the initial conditions but 
otherwise identical conditions. In any 
case, given the precision of the data on 
the proton structure function F2 and 
the neutron-proton ratio, for example 
at X « 0.2, it seems rather unlikely that 
the very wide NNPDF band reflects the 
true uncertainty. 




Figure 11: Experimental error bands for the NLO 
non-singlet combination = u + u — (d + d), 
according to the older fits in Refs. [7,41,42] and 
the recent NNPDF analysis (from Ref. [46]). 



Finally a non-singlet analysis of electromagnetic DIS has been performed in Ref. [47], 
besides the quark distributions focusing on determinations of as up to the N'^LO of Eq. [3l 
This order is accessible outside the small- a; region since, as confirmed by Ref. [12], the 
N'^LO corrections to the structure function evolution are dominated by the known coeffi- 
cient functions, see Fig. 20 of Ref. [15]. The results of Ref. [47] for the strong coupling 
constant read as(A'fz) = 0.1134,41 ± 0.002 at N^'^LO, consistent with Ref. [43] but not 
with Ref. [32] . Obviously more research is required before firm conclusions can be drawn on 
the uncertainties of the parton densities (as in Fig.[Tl]) and the determination of as- 



pp ^ (Z,7*)+X 



pp ^ W+X 





Figure 12: Rapidity-dependent cross sections for gauge-boson production at the LHC, using 
the partons of Ref. [7] . Shown are the theoretical uncertainty estimates obtained by varying 
the scale fi by the arbitrary but conventional factor of two around M\y,z (from Ref. [44]). 

The pre-LHC determinations of the parton densities will be improved upon by including 
reference cross sections measured at the LHC. The 'gold-plated' process of gauge-boson 
production is illustrated in Fig. [T^l see Ref. [48] for a more detailed discussion including 
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experimental aspects. The results shown demonstrate the importance of NNLO results even 
for processes with a far more benign perturbativc expansion than the Higgs-production cross 
section of Fig. [2] It would clearly be impossible to make precision predictions, or perform 
precision analyses, based on the rough (and non-overlapping) LO and NLO error estimates 
obtained by varying the rcnormalization and factorization scale(s). Thanks to the NNLO 
calculations [44,45], on the other hand, the perturbativc uncertainty has been reduced to a 
level of about 1%, an accuracy unprecedented for hadron-coUider cross sections. 

6 Outlook: HERA results for the LHC era 

Precision parton densities and QCD cross sections are required to fully realize the potential 
of the LHC. For example, a very precise M^-mass determination with 5Mw J: 10 MeV seems 
experimentally feasible, see Ref. [49]. Combined with Smtop — 1 GeV such a result could 
help to discriminate between, e.g., the standard model and its minimal supersymmetric 
extension - see the figure (updated from Refs. [50]) shown at the end of the talk [1]. While 
great progress has been made during the past years, considerable challenges remain. 

At the time of this write-up 15 years of data-taking at HERA have ended. Its results 
will remain indispensable throughout the LHC era, and it is important that also the high- 
luminosity results of the last phase are fully exploited - despite the obvious temptation to 
move on to, say, LHC Higgs hunting as soon as possible. Moreover, it is highly desirable to 
preserve important data, e.g., on heavy quarks and jet production, in a manner facilitating 
detailed re-analyses (as performed for PETRA data in Rcf. [51]) a decade from now. 
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